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^^ ' Abstract. We study the generic properties of finitely presented monoids 

and semigroups. We show that for positive integers a > 1, k and m, 
the generic a-generator fe-relation monoid and semigroup (defined in any 
of several definite statistical senses) satisfy the small overlap condition 
C{m). It follows that the generic monoid is torsion-free and ^-trivial 
and, by a recent result of the author, admits a linear time solution to 
its word problem and a regular language of unique normal forms for its 

,^^ , elements. Moreover, the uniform word problem for finitely presented 

■ ■ monoids is generically solvable in time linear in the word lengths and 

quadratic in the presentation size. We also prove some technical results 
r^ . about generic sets which may be of independent interest. 

1. Introduction 

Traditional complexity theory studies the time taken to solve a problem 
^ . or execute an algorithm in the "worst case", but for many problems the 

CD ! "worst case" arises very infrequently. Probably the best known example is 

^^ ' Dantzig's simplex method for linear programming [2] , which has exponential 

worst case time complexity but in practice almost invariably terminates in 
linear time (see eg. [H]). Now over 60 years old, it remains the preferred 
^— N ■ choice for practical applications, even though there are now alternative al- 

00 ! gorithms with worst-case polynomial time complexity. Phenomena such as 

^^ ' this motivated the development of average-case complexity [6], which mea- 

sures, roughly speaking, the mean difficulty of a problem across instances, 
with respect to some measure. Average-case complexity has proved ex- 
;^ ■ tremely helpful for obtaining a theoretical understanding of the "practical" 

5^ , difficulty of problems, especially within the class NP of problems admitting 

non-deterministic worst-case polynomial time solution. 

Average-case analysis can also be applied outside NP, but it meets with a 
conceptual difficulty. For most applications, what matters is not so much the 
m,ean difficulty of a problem of across instances, but rather the typical diffi- 
culty of instances encountered in practice. As is well-known to statisticians, 
the mean value of a data set is not necessarily a guide to the typical val- 
ues, since the former can be heavily skewed in one direction by a very small 
number of very extreme outliers. Likewise, the average-case complexity of a 
problem can be skewed upwards by a very small proportion of very difficult 
instances. Within NP worst cases are single exponential; this imposes a limit 
on the "extremeness" of outlying instances and hence their ability to distort 
the mean. Outside NP, however, the distortion can be much more dramatic, 
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2 GENERIC MONOIDS AND SEMIGROUPS 

with a tiny minority of extremely difficult instances potentially inflating the 
average-case complexity well beyond the complexity of the typical instance 
encountered in practice. This culminates in the extreme case of recursively 
unsolvable problems, whose average-case complexity is not defined at all, 
even though algorithms may exist to solve such problems efficiently for an 
overwhelming majority of cases [3]. 

The aim of generic- case complexity is directly to analyse the complexity of 
typical problem instances, as distinct from the average difficulty of problem 
instances. Rather than introducing a measure on the instance space, the 
key idea is the stratification of an instance space (or indeed any other set) 
into an infinite sequence of finite subsets. A subset X of the space is called 
generic if the proportion of elements in each finite set which belong to X 
approaches 1 as one moves along the sequence. The generic complexity is 
(very roughly speaking) the minimum complexity attainable on a generic set. 
Compared with the average-case approach, the key feature is that no single 
instance (indeed no finite set of instances), makes any contribution at all to 
the generic properties of the space. Generic-case complexity was introduced 
by group theorists [10], investigating the large stock of hard algorithmic 
problems which occur in the study of finitely generated infinite groups. It 
has proved especially useful in view of recent interest in the use of non- 
commutative groups as a basis for cryptographic systems pOJ, permitting 
for example a theoretical understanding of the success of the length-based 
attack [n\ on the Shpilrain-Ushakov key establishment protocol based on 
the Thompson group [19] . 

The main aim of this paper is to study the generic properties of finitely 
presented monoids and semigroups, and hence to understand the generic- 
case complexity of uniform decision problems for monoids and semigroups. 
Our main results show that, with respect to a number of very natural strat- 
ifications, the genericy finite monoid presentation (over a given alphabet 
and with a given number of generators) satisfies small overlap conditions 
in the sense introduced by Remmers [151 [T6] (see also [7j). Small overlap 
conditions are natural semigroup-theoretic analogues of the small overlap 
conditions extensively used by combinatorial group theorists, and so our 
main result can be viewed as loosely analogous (although our objectives 
and hence our formalism are rather different) to the well-known fact, first 
asserted by Gromov [5] and proved in detail by Ol'shanskii [13], that the 
generic finitely presented group is word hyperbolic. 

These results immediately tell us a great deal about the algebraic struc- 
ture of the generic monoid. For example, we learn that it is ^ -trivial, and 
hence torsion-free with no non-trivial subgroups. Even more important, by 
recent results of the author |8j, the uniform word problem for such pre- 
sentations is solvable in (worst-case) time linear in the words lengths and 
quadratic in the presentation size. Since it can be checked in (worst-case) 
quadratic time whether a presentation satisfies a small overlap condition. 



For brevity, we use statements such as "the generic X has property Y" as shorthand 
for "there is a generic subset of the set of X's, every member of which has property Y" . Of 
course the generic X truly "exists" only in the case that a single isomorphism type forms 
a generic subset oi X's; in this case the isomorphism type has all the ascribed properties, 
so the terminology is unambiguous! 
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it follows that the uniform word problem for finitely presented monoids 
is generically solvable in (worst-case) time linear in the word lengths and 
quadratic in the presentation size. All of these results apply equally to 
semigroups without identity elements. 

As already remarked, generic-case complexity has been developed by 
group theorists, and the literature is largely concerned with applications 
to advanced group theory; as a result, much of it is not readily accessible 
to non-algebraists. An additional objective of this article is to provide a 
gentle (although by no means comprehensive) introduction to generic sets 
and properties, and generic-case complexity, in a form fully intelligible to 
the reader without a specialist algebraic background. Monoid presentations 
are combinatorially simpler objects than group presentations, and most of 
our proofs are of an elementary combinatorial nature which should allow 
them double as detailed worked examples to give the reader a feel for the 
theory of generic-case complexity. The few places where we resort to more 
advanced algebraic notions are clearly delimited and self-contained, so that 
the bulk of the article can be understood without following these parts in 
detail. 

In addition to this introduction, this article comprises four sections. Sec- 
tion [5] provides a gentle introduction to generic sets and generic-case com- 
plexity. In Section El we prove our main results about generic monoids and 
semigroups with respect to certain stratifications. In Section U] we prove 
some technical results regarding the relationships between different strat- 
ifications; these may be of some independent interest; these are applied 
to show that our results about generic monoids apply regardless of which 
of several natural stratifications is chosen. Finally, Section [5] explores the 
consequences of our characterisations of generic monoids and semigroups, 
including the fact that the uniform word problems for finitely presented 
monoids and semigroups are generically solvable in time quadratic in the 
presentation lengths and linear in the word lengths. 



2. Generic Properties and Generic-case Complexity 

In this section we provide a brief introduction to generic sets and generic 
complexity. A more comprehensive treatment can be found in [1]. Our aim 
is to make the paper accessible to as wide an audience as possible, and so 
we endeavour to keep mathematical prerequisites to a minimum. However, 
we cannot avoid assuming some elementary familiarity with the theory of 
sets and sequences. 

Let 5 be a countably infinite set. A stratification of S is an infinite 
sequence 81,82, ■■■, Sn, ■■ ■ of finite subsets of 8 whose union is 8. The 
computationally-orientated reader may like to bear in mind the example 
where 8 is the instance space for some problem, and 8n is the set of instances 
of size n for some suitable notion of size; however, we caution that in general 
the subsets 8n need not be disjoint. We call the stratification spherical if 
the sets 8n are pairwise disjoint {8i n S'j = for all i 7^ j) and at the other 
extreme ascending if they form an ascending sequence under containment 
{8i C 8j for all i < j). 
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Now let X be a subset of S. We say that X is generic (with respect to 
the given stratification) if 

|xn5„| . . 

The subset X is cahed negligible if S \X is generic, or equivalently, if the 
hmit on the left-hand-side of ([T]) is defined and equal to 0. Intuitively, X 
is generic if the probability that an instance of size n, chosen uniformly 
at random, lies in X can be made arbitrarily close to 1 by choosing large 
enough n. 

Note that, for any given set X, the limit on the left-hand-side of ([T]) may 
not be defined, and indeed for almost any stratification it is easy to construct 
a set X for which it is not. The function 

X I— > lim 



I "? I 

is a finitely additive probability measure defined on those subsets of X for 
which the limit converges, but it is typically not a measure in the usual 
sense, since it lacks countable additivity. This fact is no accident: a count- 
ably additive measure on a countable set cannot assign 0-measure to all the 
singletons, but as we noted in the introduction, a key feature of the generic 
approach is that single instances are regarded as negligible. Nonetheless, 
the intuition that the generic sets are those of "full measure" can be helpful, 
and they satisfy many of the elementary properties of such sets. In partic- 
ular, the reader can easily verify that if X is generic and X CY then Y is 
generic, while if X and Y are both generic then X nY is generic. Obvious 
dual statements hold for negligible sets. 

Notice that, in our initial definition of generic sets, we have placed no re- 
quirements on the rate of convergence of the left-hand-side of ([T]) . Genericity 
is an asymptotic property, and if convergence is very slow then the asymp- 
totic behaviour may not be reflected in "practical sized" instances. We call a 
set X superpolynomially generic/negligible if the appropriate limit converges 
faster than 1/n^ for every p G N, and exponentially generic/negligible if it 
converges faster than j?" for for some p € (0, 1]. (In the literature some au- 
thors use the term "strongly generic" for what we have called exponentially 
generic sets, while some use "strongly generic" to mean superpolynomially 
generic and "super generic" to mean exponentially generic. To avoid confu- 
sion, we shall avoid these terms in favour of less concise but more descriptive 
ones.) 

We now turn our attention to the application of generic sets in computa- 
tional complexity. This requires us to consider explicitly not just abstract 
algorithmic problems, but also also stratiflcations of instance spaces. We 
define a stratified problem to be an algorithmic decision problem equipped 
with a stratification on its instance space. (We shall restrict our attention 
here to decision problems, but analogous definitions can be made for more 
general computational problems.) 

Of course traditional complexity theory is implicitly concerned with strat- 
ified problems: to study the asymptotic complexity of a problem one requires 
a notion of the size of each member of the instance space S. As we have 



GENERIC MONOIDS AND SEMIGROUPS 5 

already remarked, this automatically induces a stratification given by set- 
ting Sn to be the set of all instances of size n. We call this the input size 
stratification for the problem. However, the dependence on stratification is 
much tighter in generic complexity theory than it is in traditional complex- 
ity theory - many authors discussing traditional complexity of algorithmic 
problems prefer to avoid detailed discussion of data encoding and hence of 
exact instance sizes; this is entirely reasonable since traditional complexity 
classes are largely insensitive to minor encoding issues. But for generic-case 
complexity, these issues can make a very big difference. 

Note also that, while the input size stratification is a natural, canonical 
one to associate to any algorithmic problem, it is only one of many possible 
stratifications, and may not be the appropriate one for any given applica- 
tion. The ideal is rather to find a stratification which reflects the empirical 
distribution of problem instances, that is, the frequency with which they 
arise in practice in a particular application, and there is often no reason to 
suppose that this is strongly correlated with size. 

Now let C be any class of decision problems (typically a complexity class 
of some kind). We say that a stratified problem ^ is generically in C if 
there exists a generic subset Y of the instance space such that 



(i) the membership problem for Y lies in C; and 
(ii) the problem ^ restricted to Y lies in C. 



Intuitively, a stratified decision problem is generically in C if the decision 
problem admits a partial algorithm (that is, an algorithm which outputs 
"yes", "no" or "don't know", and which in the former two cases is always 
correct) in C, such that the probability of a "don't know" is negligible. We 
write GenC for the class of all stratified problems generically in C. 

Obvious examples are the class GenP of generically polynomial-time 
stratified problems and GenNP of generically non- deterministic polynomial- 
time stratified problems. Another interesting example is the class GenBPP, 
which consists of stratified problems admitting a randomised polynomial- 
time algorithm with probabability of error uniformly bounded away from 
1/2 for every instance in some generic subset whose membership problem 
also lies in BPP. 



3. Generic Monoid Presentations 

In this section we study the generic properties of finite monoid presenta- 
tions. We begin with some basic definitions. 

Let A be a finite alphabet (set of symbols). A word over ^ is a finite 
sequence of zero or more elements from A. The set of all words over A is 
denoted A*; under the operation of concatenation it forms a monoid, called 
the free monoid on A. The length of a word w £ A* is denoted \w\. The 
unique empty word of length is denoted e; it forms the identity element of 
the monoid A*. The set A"*" \ {e} of non-empty words forms a subsemigroup 
of A*, called the free semigroup on A. 
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A finite monoid presentation {A \ R) consists of a finite alphabet A, 
togetlier with a finite sequence R C A* x A* of ordered pairs of wordqj. We 
say that u,v (^ A* are one-step equivalent if u = axb and v = ayb for some 
possibly empty words a,b & A* and relation (x,y) £ R or (y, x) G R. We 
say that u and v are equivalent, and write u =r v or just u = v, if there is 
a finite sequence of words beginning with u and ending with v, each term 
of which but the last is one-step equivalent to its successor. Equivalence 
is clearly an equivalence relation; in fact it is the least equivalence relation 
containing R and compatible with the multiplication on R. The equivalence 
classes form a monoid with multiplication well-defined by [ii] = [f]= = [^11]=; 
this is called the monoid presented by the presentation. 

The word problem for a (fixed) monoid presentation {A \ R) is the algo- 
rithmic problem of, given as input two words u,v G A*, deciding whether 
u =/j V. The uniform word problem for finitely presented monoids is the 
algorithmic problem of, given as input a monoid presentation {A \ R) and 
two words u,v € A*, deciding whether u =/? v. It is well-known that there 
exist finite monoid presentations which the word problem is undecidable, 
and hence that the uniform word problem for finitely presented monoids is 
undecidable |12l [T^ . More generally, if ^ is a class of finite monoid presen- 
tations, then the uniform word problem for ^ monoids is the algorithmic 
problem of, given as input a monoid presentation {A \ R) in ^ and two 
words u,v £ A*, deciding whether u =r v. 

Now suppose we have a fixed monoid presentation {A \ R). A relation 
word is a word which appears as one side of a relation in R. A piece is a 
word which appears more than once as a factor in the relations, either as 
a factor of two different relation words, or as a factor of the same relation 
word in two different (but possibly overlapping) places. Let TTi G N be a 
positive integer. The presentation is said to satisfy C{m) if no relation word 
can be written as a product of strictly fewer than m pieces. Thus C(l) says 
that no relation word is empty; C(2) says that no relation word is a factor 
of another. 

Definitions corresponding to all of those above can also be made for semi- 
groups (without necessarily an identity element), by taking A'^ in place of 
A* (in all places except the definition of one-step equivalence, where a and 
b must still be allowed to be empty). 

Now fix an alphabet A. To study generic properties of /c-relation presenta- 
tions over A, we need a stratification on the (countable) set of all such. There 
are two obvious ways to define the size of a presentation, and hence two nat- 
ural stratifications of the ^-generated /c-relation presentations. Firstly, one 
can take the size of the presentation to be the sum, length of the relation 
words; this gives rise to the sum length stratification of presentations. Al- 
ternatively, one can define the size to be the length of the longest relation 
word; this results in the maximum relation stratification. Which choice is 
most natural depends on the application. For example, the sum length 
of a presentation is a good approximation to the space required to encode 



9 
The reader may think it more natural to consider a set of unordered pairs, but the 

definition we use simplifies the combinatorics in our analysis, and Theorem [S] will show 

that it makes no difference to the end results. 
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the presentation in the obvious way, and hence for computational apphca- 
tions seems the most natural. Intuitively, the sum length stratification lends 
greater weight to uneven distributions of the relation word lengths within a 
presentation; in particular, it results in a greater frequency of short words, 
which makes it seem less likely that small overlap conditions will hold. Nev- 
ertheless, it transpires that our main results hold for both stratifications, 
which may be regarded as some evidence of their "robustness" . 

We emphasise that we are attempting here to stratify only the set of A- 
generated, /c-relation semigroup presentations, where the alphabet A and 
set of relations k are fixed. There are, of course, also natural stratifications 
across all ^-generated semigroup presentations, allowing the number of re- 
lations to vary. These typically lead to a high frequency of "short" relation 
words, which means that small overlap type conditions do not hold generi- 
cally. However, it seems likely that, for at least some natural stratifications 
of this type, the word problem remains generically solvable for other reasons. 
This interesting issue will be studied further in a subsequent paper. 

We shall need a couple of elementary definitions from combinatorics. Let 
n and k be non-negative integers. Recall that a composition of n into k is an 
ordered /c-tuple of positive integers which sum to n, while a weak composition 
of n into k is an ordered fe-tuple of non-negative integers which sum to n. 

Having fixed the alphabet A, a fc-relation monoid presentation of sum 
length n is uniquely determined by its sequence of relation words; this in 
turn is uniquely determined by the concatenation in order of those words (a 
word in A^) and the lengths of those words (a weak composition of n into 2k, 
called the shape of the presentation). Thus, fc-relation monoid presentations 
of sum relation length n are in a bijective correspondence with ordered pairs 
whose first component is a word of length n, and whose second component 
is a weak composition of n into 2k. 

We shall need the following simple combinatorial lemma. 

Lemma 1. Let A he a finite alphabet and c and p be positive integers. The 
number of distinct words of length c which admit factorisations as xivyi and 
as X2vy2 for some xi,X2,yi,y2,v £ A* with \v\ > p and xi / X2 is bounded 
above by c'^\A\'^~p. 

Proof. Clearly if a word admits such factorisations, then it admits such 
factorisations with |t;| = p, so we may count only those words which admit 
such factorisations with \v\ = p. 

We claim, having fixed A, c and p, any such word is uniquely determined 
by xi, yi and the length of X2. Clearly, there are fewer than c^ ways to 
choose the lengths of xi and X2; doing so also fixes the length of yi, since 
we must have 

\xi\ + \v\ + \yi\ = \xi\ +P+ \yi\ = c. 
Now there are at most 

M||a:i| + |?/i| _ M|c-bl _ M|c-P 

ways to choose the words xi and yi with the given lengths, so proving the 
claim will suffice to prove the lemma. 

Since xi and X2 are distinct prefixes of the same word, their lengths cannot 
be equal. Suppose first that xi is longer than X2 and write v = v^^' . . . v^'""'' 
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and xi = x\ . . . x-^ with each v^^' and x^^' in A. Then since xivyi 
X2vy2 we have 



t;« 



for 1 < i < |xi| — |x2| 

y{l~'\xi\ + \x2\) £qj. I^;^! _ |2;2| < i < |f I 



from which the claim follows. 

If, on the other hand, xi is shorter than X2 then we use the lengths of v 
and X2 to deduce the length of y2, whereupon a symmetric argument suffices 
to complete the proof. D 

Proposition 1. Let A be a finite alphabet, and n and r be positive integers, 
and fix a weak composition a of n (into any number). Then the proportion 
of presentations of shape a which have a piece of length r or more is bounded 
above by n'^\A\~^ . 

Proof. The set of presentations over A of shape a is in 1:1 correspondance 
with the set A" via the map which takes each presentation to the concate- 
nation, in the obvious order, of its relation words. If the presentation has 
a piece of length r or more then the corresponding word will feature that 
piece as a factor in at least two different places. By Lemma [1] it follows that 
the number of presentations with a piece of length r or more is bounded 
above by n^|^|"~^. The total number of such presentations in |A|", so the 
proportion of presentations with the desired property is bounded above by 
n^l^l"'' as required. D 

Corollary 1. Let A be a finite alphabet and k, n, m and K be positive 
integers with m > 2, and fix an weak composition a of n into 2k such 
that no block has size less than. Then the proportion of presentations with 
alphabet A and shape a which do not satisfy C{m) is bounded above by 

n^ 



|^|i^/(m-l) • 

Proof. If a presentation fails to satisfy C{m) then some relation word can 
be written as a product of m — 1 pieces. By assumption this relation word 
must have length at least K, so one of the pieces must have length at least 
K/{m — 1). The result is now immediately from Proposition [TJ D 

Before proving the first of our main theorems, we will need an elementary 
combinatorial result concerning weak compositions; this will serve to bound 
the proportion of presentations which feature a "short" relation word. 

Lemma 2. Let k be an integer, and f : N ^ N be a function such that 
f{n)/n tends to zero as n tends to infinity. Then the proportion of weak 
compositions of n into k which feature a block of size f{n) or less tends to 
zero as n tends to infinity. 

Proof. It is well-known and easy to prove (see, for example, [U Theorem 5.2]) 
that the number of weak compositions of n into k is given by 

_ jn + k-iy. 

^^^''^ " n!(A:-l)! 



GENERIC MONOIDS AND SEMIGROUPS 9 

Clearly, every partition of n into k featuring a block of size f{n) or less can be 
obtained by refining a partition of n into k — 1, with the extra decomposition 
in one of k{f(n) + 1) places. Thus, the number of such partitions is bounded 
above by 

k (fin) + 1) C',_,{n) = k (fin) + 1) ^^,^^ _ ^^j' 

Hence, the proportion of such partitions amongst all weak compositions of 
n into k is bounded above by 

k (/(n) + l) gfc_i(n) _ k (/(n) + l) {n + k - 2)\ n\ {k - 1)1 
q(n) ~ {n + k-iy. n! (fe-2)! 

^ fc(fc-l) (/(n) + l) 
n + k — 1 



n + k — 1 n + k — 1 

which clearly tends to zero as n tends to infinity. D 

We are now ready to prove our main theorem for the sum relation length 
stratification. 

Theorem 1. Let A he an alphabet of size at least 2, and k and m he posi- 
tive integers. Then the set of A-generated, k-relation monoid presentations 
which satisfy the condition C{m) is generic with respect to the sum length 
stratification. 

Proof. Since C(2) implies C(l), we may clearly assume without loss of gen- 
erality that m > 2. We need to show that the proportion of A-generated 
/c-relation monoid presentations of length n which fail to satisfy C{m) tends 
to zero as n tends to infinity. 

For each re, let Pn be the set of all weak compositions of n into A;, let 
Qn be the set of weak compositions of n into k featuring a block of size 
3(rre— 1) logiy^i re or less, and let i?„ = Pn\Qn- By an application of LemmaO 
with the function / : N ^ N given by /(n) = 3{m — 1) logi^i re, we see that 
the proportion |Qn|/|-fn| tends to as n tends to infinity. 

For each weak composition a, let Xa be the proportion of presentations 
of shape a which fail to satisfy C{m). Note that by Corollary [T] we have 

re^ 
x„ < 



MK„/{m-l) 



where K^^ denotes the smallest block size in a. For each fixed re, there are 
clearly equally many (|A|" to be precise) presentations of each shape, so the 
proportion of presentations of length re failing to satisfy C{m) is just the 
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average over shapes a oi x^, that is: 






r? 



Pn\ \ "^ \Pn\\ ^^ M|i^a/(m-l) 



\Qn\ , 1 ( Y^ ^ 



+ D^ E 



2 



IP I IP I \ ^-^ \A\!<a/{m-l) / ■ 

\crGR„ / 



We have aheady observed that |Qn|/|-fn| tends to zero as n tends to infinity. 
Moreover, by the definition of i?„ we have K^j > 3{m — l)\og\Mn for all 
a S Rn so that 



O-^Rn 



^ X<,/(m-l) 


< 


1 


14 (3('"-l)log|A|'^)/('"-l) 




= 


\Rn\ 


r? 




\Pn\ 


|^|(3(m-l)log|^|n)/(m-l) 






\Rn\ 


n2 






\Pn\ 


14 l°glAl("^) 




< 


2 

n 





which tends to zero as required. D 

An analysis of the proof shows, approximately speaking, that the propor- 
tion of presentations of A failing to satisfy any given small overlap condition 
goes to zero like (logiy^i n)/n, which for practical purposes may be rather 
slow. The barrier to showing a faster convergence is the proportion of pre- 
sentations featuring a "short" relation word (IQnI/l-Pnl in the notation of the 
proof); this proportion really does seem to decrease very slowly, suggesting 
that for the sum length stratification, fast convergence to small overlap con- 
ditions is not possible. To obtain statements about the "superpolynomially 
generic monoid" or "exponentially generic monoid" with respect to the sum 
length stratification, one would require arguments which take detailed ac- 
count of the "short" relation words. 

Our next task is to prove that an equivalent result holds for the maximum 
length stratification. We begin with an analogue of Lemma [21 which will 
show that the frequency of presentations featuring a "small" relation word 
is again negligible. This time, because the number of presentations of each 
shape of maxmimum length k is not fixed, we must reason directly with 
presentations rather than just shapes. Having taken account of this, the 
result is easier and, as one might expect given our remarks above on the 
relative frequency of "short" relation words in this stratification, stronger. 

Lemma 3. Let A he an alphabet of size at least 2, k be a non-negative in- 
teger, and / : N ^ N 6e a function such that n — f{n) tends to infinity as 
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n tends to infinity. Then the proportion of A- generated k-relation presen- 
tations of maximum relation word length n which feature a relation word of 
length f{n) or less tends to zero as n tends to infinity. Moreover, if there 
exists a constant p > such that n — f{n) > pn for sufficiently large n then 
the given proportion tends to zero exponentially fast. 

Proof. Let X^ be the set of all presentations over A of maximum relation 
length n, let Y^ be the presentations in X„ which have a relation word of 
length /(n) or less, and let Z„ = X„ \ y„. The quantity we seek is thus the 
limit as n tends to infinity of |1^|/|X„|. Let / = {1, . . . , 2fc} and define a 
map a from I x X^ to the set of all presentations /c-relation presentations 
of A, which takes (i, P) to the presentation obtained from P by removing 
n — f{n) characters from the end of the ith relation word, or replacing this 
relation word with the empty word if its length is less than n — f{n). 

We claim that under the map a, every presentation in Yn has at least 
|^|"--/(") pre-images in I x X„. Indeed, if Q 6 Y^ then Q has some relation 
word (say the jth) of length less than /(n), say length p. Now for each of 
|^|"-/(") words w € ^"~-^("') we can obtain from Q a presentation P^ € X^ 
by appending w to the end of the jth relation word, and it is easily seen 
cO> Pyj) = Q for all such w. 

Thus, we have 2k\Xn\ = \I x Xn\ > |^|"-^(")|y„|, and so 



\Xn\ ~ |A|"-/W 

Since n — f[n) tends to infinity with n, this clearly tends to zero. If moreover 
p > is such that n — /(n) > pn for n sufficiently large then we have 



\Xn\ ~ \AY"^ 

SO that the given quantity tends to zero exponentially fast. D 

Corollary 2. Let A he an alphabet of size at least 2, k be a non-negative in- 
teger, and c a constant with < c < 1. Then the proportion of A- generated, 
k-relation presentations of maximum relation word length n which feature 
a relation word of length en tends to zero exponentially fast as n tends to 
infinity. 

Proof. Define / : N — > N by f{n) = en, and choose p with 

< p < 1 - c. 

Then n — f{n) = (1 — c)n > pn for all n, so the result follows from Lemma[3l 

D 

We are now ready to prove our main result for the maximum length 
stratification. 

Theorem 2. Let A be an alphabet of size at least 2, and let k and m be pos- 
itive integers. Then the set of A-generated, k-relation monoid presentations 
which satisfy C{m) is exponentially generic with respect to the maximum 
length stratification. 
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Proof. The structure of the proof is essentiahy the same as that for Theo- 
rem [21 but it is shghtly comphcated by the fact that the number of presen- 
tations of each shape for a given maximum relation word n is not fixed. In 
addition, we must to show that the rate of convergence is exponential. Once 
again, we assume without loss of generality that m >2. 

Let Cn be the total number of presentations over A of maximum relation 
word length n. Let P„ be the set of all weak compositions of any integer into 
2k with largest block size n. Choose d with < d < 1 and let Qn be the set 
of all shapes in Pn with a word of length dn or less. Let Rn = Pn\ Qn- For 
each weak composition a (z Pn, let Cq- be the total number of presentations 
of shape a, and let x^ be the proportion of presentations of shape a which 
fail to satisfy C{m). For each shape o", by Corollary [T] we have 

X < ^"-)' 

- \A\K./{m-l) 

where n^j is the total size of a (that is, the sum of the block sizes of o", or 
the sum relation word length of a presentation of shape a), and K^j is the 
smallest block size in a. But a has 2k blocks, none of which is larger than 
n, so we must have n^j < 2kn, so that 

(2A;n)2 _ A k"^ n^ 
Now the proportion we seek is given by 

"■ \fTGP„ / " \aeQn I \aeR„ / 




The first term in the last line is the proportion of presentations featuring 
a relation word of length dn or less; by Corollary [2l this tends to zero 
exponentially fast. Considering now the second term, by the definition of 
Rn we have that K^^ > dn for all a G i?,„ so that 



TT E^-- 



Ak'^n'^ 1 v-^ 4A;^n^ 



< ^ V c„ 



Cn ^ ''\A\KoKm-\) - Cn ^ '^ M |<in/(m-l) 



< 



|^|dn/(m-l)y \^ Cn 

Ak^n^ 



which since 1^41 > 2 and d > clearly tends to zero exponentially fast. D 

4. Equivalence of Stratifications 

It often happens that two stratifications (on the same set, or on related 
sets) are closely related, so that knowledge of the generic sets with respect to 
one yields corresponding information about the generic sets with respect to 
the other. In this section we establish some technical conditions under which 
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this holds, and use this to extend many of our earher results to additional 
natural stratifications. 

First, we consider the relationship between spherical and ascending strat- 
ifications. So far, we have seen examples only of spherical stratifications 
of instance spaces, but to each such stratification is associated an equally 
natural ascending stratification, the sets in the latter being unions of the 
sets in the former. The following proposition, which was first observed in 
[1] to be an easy consequence of the Stolz-Cesaro Theorem, says that the 
generic sets are independent of which of these stratifications is used (see [1] 
for a more detailed explanation). 

Proposition 2. [U Lemma 3.2] Let 5„ be a spherical stratification of a set 
S. Define a new stratification on S by 

n 
^n = IJ Sj. 

i=i 

Then any set X (^ S is generic with respect to the stratification Sn if and 
only if it is generic with respect to the stratification Bn ■ 

We shall need the following elementary proposition, which essentially says 
that the restriction of a stratification to a generic set preserves generic sets. 

Lemma 4. Let X be a stratified set, and X' a generic subset of X. Then 
for any P (^ X we have 

,. |Pnx„| ^. \PnXnnx'\ 

lim — — — - — = lim 



n^oo \Xr,\ n^oo |X„nX'| 



Proof. First notice that, since X' is generic, we have 

,.^ \pnxn(x\x')\ ^ ^^ \(x\r)nx„\ ^ „ 

Now 

IPnXnHX'l , \PriX„r\X'\ \x„\ 

lim — r^T TT7. — = lim 

n— »oo 



\Xr,.nX'\ n-.oo \Xn\ |X„nX'| 

^. \Pr\Xnr\X'\\ / . \x„nx'\ 

lim — — lim 



71— »00 



Xr,\ I \n— >oo \X, 



lim — — 1 (since A is generic) 



n— >oo \Xn 



lim \IS1^I^\ + 

lim £M^) + ( lim l^"^";;f\^'' ) (by © 

\Pr\Xnr\X'\ \Pr\Xnr\ix\x') 

lim rr^. + rr^. 

n^oo |A„| \^n\ 

,. \Pr\Xn\ 

^™ ly I 



as required. D 
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Next, we introduce a very useful sufficient condition for a map between 
stratified sets to preserve generic sets. To do so, we need some terminology. 
Let X and Y be stratified sets, X' C X and Y' C y, and f : X' ^^Y' Si map. 
Then / is called stratification-preserving if for every x & X' and n € N we 
have X € Xn if and only if f{x) &Yn. li P C X then / is said to respect P if 
f{P n X') and /((X \P)n X') are disjoint, that is, if whenever xi, X2 € X' 
are such that /(xi) = /(X2) we have either xi, 2:2 G P or xi,X2 ^ P. Recall 
that the /i6re size of / at a point y € Y' is the cardinality of the set of 
elements x G X' such that f{x) = y. The map / is called bounded-to-one if 
there is a finite upper bound on its fibre sizes. 

Theorem 3. Let X and Y be stratified sets, X' Q X and Y' QY be generic 
subsets of X and Y respectively, d G N and f : X' ^ Y' a surjective, 
stratification-preserving map, such that for every n G N there exists A;„ G N 
such that the fibre sizes of f at points in X„ Pi X' all lie between kn and dkn ■ 
Then for any set P C X we have 

(i) 

1, \f(Pnx')nYn\ , IPnxJ ,, |/(Pnx')ny„| 

- lim ^ --4 — < lim ' ,„ , ' < d lim ''' ^ ' "' 



d n^oo |y„| n-*oo \X„\ n^oo |y„| 

wherever both limits are defined; 
(ii) 

1, |Pnx„| , \f(Pr\X')r\Yn\ ,, \Pr\Xn\ 

hm — -— -; — < lim — — -; < a lim 



d n^oo \Xn\ n— >oo |1^| n^co \Xn\ 

wherever both limits are defined; 
(iii) P is negligible in X if and only if f{P n X') is negligible in Y; 
(iv) // P is generic in X then f{P D X') is generic in Y; 
(v) If d = 1 and f{P fl X') is generic in Y then P is generic in X; and 
(vi) // / respects P and f{P fl X') is generic in Y then P is generic in 

X. 

Before proving Theorem [3l we emphasise that parts (i) and (ii) do not 
guarantee that one of the limits involved is defined exactly if the other is 
defined. If one of the sequences converges to some value c, then only in the 
case c = can we be certain that the other will converge. If c 7^ then 
the other may fail to converge, although one can easily show that it will 
eventually be constrained to vary within the range [d~^c, dc]. We now turn 
to proving Theorem [3l 

Proof. By the bounds on the fibre sizes of / we clearly have 

\f{Pr\X'r\Xn)\ < \Pr\ x'nXnl < d\f(P n x' nXn)\ 

and 

\f{x'nXn)\ < \x'nXn\ < d\f{x'nXn)\ 

for all n G N. It follows from the fact that / is surjective and stratification- 
preserving that fix'nXn) = Y'nYn and /(Pnx'nx^) = /(Pnx')nF„, 
so the above inequalities become 

|/(Pnx')ny„| < \Pnx'nXn\ < d|/(Pnx')ny„| 
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and 

\Y'nYn\ < |x'nx„| < d\Y'nYn\ 

respectively. Now combining these yields 

i \f{Pnx')nYn\ ^ iPnx^nx'l ^ \f{Pnx')nYn\ 

d \Y'r\Yn\ - \XnnX'\ - \Y'r\Yn\ ■ ^ ^ 

It follows also that 

i|Pnx„nx'| \f{Pr\X')r\Yn\ ,|Pnx„nx'l 

_l n 1 ^ \J_\ )_ nl ^ ^1 n 1 ,^. 

d \Xnr^X'\ - \Ynr^Y'\ - |x„nx'| ^ ' 

where the left-hand [respectively, right-hand] inequality is obtained by di- 
viding [multiplying] both sides of the right-hand [left-hand] inequality in ([3]) 
by d. 

Now since X' and Y' are generic in X and Y respectively, Lemma H] gives 

,. jPnx^l ^. \Pr\X'r^Xn\ 

lim — — -— ; — = hm 



and 

,. |/(Pnx')ny„| ,. |/(Pnx')ny„ny'| ^. |/(Pnx')ny„ 

lim — — = lim — — = lim 



n-^oo \Yn\ n^oo [!"„ H Y' | n^oo [!"„ Pi T' | 

where the second equality on the second line holds because f{Pr\X') C Y' . 
It is now clear that parts (i) and (ii) follow from ([3]) and ([!]) respectively. 

If f{Pr) X') is negligible in Y then the left and right-hand sides of (i) 
converge to 0, from which it follows that the middle expression converges 
to 0, and so P is negligible. Conversely, if P is negligible then exactly the 
same argument applies with (ii) in place of (i) to show that f{P n X') is 
negligible. This proves part (iii). 

If P is generic in X then X \ P is negligible in X, so by part (iii), f{{X \ 
P) n X') is negligible in Y. But by surjectivity, we must have 

Y'\fiPnx')Qfi{x\P)nx') 

so that Y'\f{PriX') is negligible in Y. Since Y' is generic in Y and generic 
sets are closed under intersection, it follows that 

Y\fiPn X') = (Y' \ f{p n X')) u (y \ y') 

is negligible in y, so that f{Pr\X') is generic in Y as required to prove part 
(iv). 

If d = 1 and f{P n X') is generic in Y then it is immediate from part (i) 
that P is generic in X, so that part (v) holds. 

Finally, suppose that / respects P and that f[P n X') is generic in Y . 
Since / is surjective we have 

y = f{x') = fi{x\P)nx')uf{Pnx'). 

Now since / respects P, we know that f{{X \P)n X') and f{P D X') are 
disjoint, and since Y' is generic in Y is follows that 

f{{x\P)nx') = Y'\fiPnx') 

is negligible in Y. But now by part (iii), we deduce that X \P is negligible 
in X, and hence that P is generic in X, as required to prove part (vi). D 

A particularly useful special case is the following immediate corollary. 
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Corollary 3. Let X and Y be stratified sets, X' (1 X and Y' (^Y be generic 
subsets of X and Y respectively, f : X' ^ Y' a surjective, stratification- 
preserving, bounded-to-one map. Then for any P Q X such that f respects 
P, we have that P is generic [respectively, negligible] in X if and only if 
f{P n X') is generic [negligible] in Y . 

Next, we apply Theorem E] to show that the generic properties of finitely 
presented semigroups are essentially governed by those of finitely presented 
monoids. Recall that if S* is a semigroup then S^ denotes the monoid with 
set of elements 5U{1} where 1 is a new symbol not in S, and multiplication 
defined by 



st 



the 5-product st if s, t € S; 
s if t = 1; 

t if s = 1. 



Theorem 4. Let 'W be a class of monoids, A a finite alphabet and A; G N. 
Then the generic A-generated k-relation monoid (with respect to either the 
sum length stratification or the maximum length stratification) belongs to ^ 
if and only if the generic A-generated k-relation semigroup S (with respect 
to the corresponding stratification) is such that S^ belongs to 'to. 

Proof. Let X and Y be the sets of /c-relation monoid and semigroup pre- 
sentations respectively over A. Suppose X and Y are equipped with either 
the sum length or the maximum length stratification. Let P be the set of 
presentations in X such that the monoid presented lies in ^, and let Q be 
the set of presentations in Y such that the semigroup S presented is such 
that S^ lies in ^. 

Let Y' = Y and let X' = Y (^ X be the set of semigroup presentations 
viewed as a subset of the set of monoid presentations, that is, those monoid 
presentations in which no relation word is empty. By Lemma [2] (for the sum 
length stratification) or Lemma [3] (for the maximum length stratification) 
X' is generic in X, and obviously Y' = Y is generic in Y. 

Define f : X' = Y ^ Y' = Y to he the identity function. Then / is 
1 : 1, surjective onto Y', and preserves the sum length and maximum length 
stratifications. Letting d = 1 and A;„ = 1 for all n, we see that the conditions 
of Theorem [3] are satisfied, so P is generic in X if and only if f{P n X') is 
generic in Y. 

Since / is the identity function on X', a semigroup presentation ^ € 
f{Pr]X') exactly if ^ interpreted as a monoid presentation lies in P. Since 
^ has no empty relation words, it is easy to see that the monoid presented 
by J^ is isomorphic to S^, where S is the semigroup presented by J^. Thus, 
^ e f{Pr\ X') if and only if S^ G "T, that is, if and only if ^ e Q. Hence, 
f{P n X') = Q, and so P is generic in X if and only if Q is generic in Y, as 
required. D 

Corollary 4. For every m > 1, A; € N and alphabet A of size at least 2, 
the generic A-generated k-relation semigroup (with respect to either the sum 
length stratification or the maximum length stratification) satisfies the small 
overlap condition C{rn). 
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An unordered monoid presentation consists of a set A of generators and an 
(unordered) set R of relations, each of which is an unordered pair of words 
from A* . Equivalence of words is defined exactly as for ordered presentations 
(see Section [3]), as are the sum length and maximum length stratifications on 
the sets of j4-generated presentations with some fixed number k of relations. 
There is an obvious map from the ordered to the unordered presentations 
over a given alphabet A, which simply "forgets" the ordering of the relations 
and the ordering of the pair of words in each relation, and discards any 
"duplicate" relations. Unordered semigroup presentations can of course be 
defined analogously. 

Theorem 5. Let '^ be a class of monoids, A an alphabet and k a non- 
negative integer. Then the generic [negligible] A-generated k-relation monoid 
(with respect to either the sum length stratification or the maximum length 
stratification) belongs to "^ if and only if the generic [respectively negligible] 
a-generator k-relation unordered monoid (with respect to the corresponding 
stratification) belongs to '^ . The corresponding statement for semigroups 
also holds. 



Proof. We prove the result for monoids; that for semigroups can be proved 
in exactly the same way. Let X be the set of ordered A;-relation monoid 
presentations over A, and Y the set of unordered /c-relation monoid presen- 
tations over A. Let P Q X and Q '^ Y he the sets of presentations in X 
and Y respectively such that the monoid presentated belongs to '^. 

Let X' C X be the set of ordered presentations which do not feature 
the same relation twice, or two relations of the form (u, v) and {v, u) for 
some distinct words u and v. We have seen that C(2) presentations do 
not feature the same relation word twice, so X' certainly contains all the 
C(2) presentations. It follows by Theorem [1] (for the sum relation length 
stratification) or Theorem [2] (for the maximum relation length stratification) 
that X' is generic in X. Let Y' = Y; then certainly Y' is generic in Y. 

Define f : X' ^> Y' = Y to he the restriction to X' of the obvious 
map described above from ordered to unordered presentations. It is clear 
from the definition of X' that / preserves the number of relations in the 
presentation and so really does define a map to Y, and moreover that this 
map is surjective. Since / takes each ordered presentation to an unordered 
presentation of the same monoid, it is also obvious that / respects P and 
maps P nX' onto Q. It is easily seen that / preserves both the sum length 
and the maximum length stratifications. Moreover, / clearly has fibre size 
bounded above by k\2 . It follows that the conditions of Corollary [3] are 
satisfied, so that P is generic in X if and only if f{P) = Q is generic in 

Y. n 



We thus allow ourselves to speak of a generic monoid or generic semigroup, 
without worrying about whether the presentation is defined to have a set or 
a sequence of relations. 



18 generic monoids and semigroups 

5. Properties of Generic Monoids and Semigroups 

In this section we explore some of the consequences of our results for 
generic monoids and semigroups. Recall that a monoid or semigroup is called 
^ -trivial if distinct elements always generate distinct principal ideals. 

Proposition 3. Any C(3) semigroup or monoid is torsion-free and ^ - 
trivial. 

Proof. Let 5 be a semigroup or monoid with a C(3) presentation {A \ R). 
By a result of Remmers [15J, only finitely many words over the alphabet A 
represent the same element of S. 

Suppose first that S it is not ^-trivial, and choose a,b £ S he distinct 
elements generating the same ideal. Then in particular, a is in the ideal 
generated by b, so we have a = pbq for some p,q £ S. But also b is in 
the ideal generated by a, so that and b = ras = rpbqs for some r,s G S. 
Now choose words b,p,q,f,s G A* representing b,p,q,r,s € S respectively. 
Certainly at least one of r and s is non-empty, since otherwise we would have 
r = s = 1 so that b = ras = a. But now it is easily seen that {fpyb{qsy 
represents b for every i > 0, contradicting Remmers' result. 

Similarly, suppose a G S" is non-identity torsion element. Then there 
is a non-empty word a € A representing a. But now it is easy to see 
that infinitely many powrs of a must represent the same element, again 
contradicting Remmers' result. D 

Combining with our theorem with have the following. 

Theorem 6. Let A be an alphabet of size at least 2 and let k be a posi- 
tive integer. Then the monoid defined by the generic A-generated k-relation 
presentation (with respect to either the sum length stratification or the max- 
imum length stratification) is non-trivial, torsion-free and ^-trivial. In 
particular, it is not a group, an inverse monoid or a regular monoid. The 
corresponding statements for semigroups also hold. 

Proof. By Theorem [U (respectively Theorem [2] for the other stratification) 
the generic A-generated /c-relation presentation satisfies C(3), and so by 
Proposition [3] the semigroup presented is torsion-free and ^-trivial. If it 
were trivial then every word over the alphabet would have to represent the 
identity, contradicting once more Remmers' result mentioned in the proof 
of the previous proposition. D 

By a recent result of the author, the uniform word problem for C(4) 
semigroups is solvable in time linear in the word lengths and polynomial in 
the presentation size fU Theorem 2]. Hence, we obtain 

Theorem 7. Let A be an alphabet of size at least 2 and let k be a positive 
integer. Then the generic A-generated k-relation presentation (with respect 
to either the sum length stratification or the maximum length stratification) 
has word problem solvable in linear time. The corresponding statement for 
semigroups also holds. 

Since there is also an algorithm to decide, in (worst-case) polynomial time 
whether a given presentation satisfies the condition C(4) [8^ Corollary 5], 
we also obtain 
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Theorem 8. Let A he an alphabet of size at least 2 and k he a positive 
integer. Then the uniform word problem for A-generated, k-relation monoid 
presentations is generically solvable in polynom,ial time. The corresponding 
statement for semigroups also holds. 

Further work of the author [9] has established a number of automata- 
theoretic properties of monoids which admit finite presentations satisfying 
the condition C(4). It follows from Theorems [1] and [2] that the "generic" 
monoid and semigroup will enjoy all these properties. The following theorem 
summarises these properties; for brevity we omit definitions of terms; which 
can be found in [9]. 

Theorem 9. Let A be an alphabet of size at least 2 and let k he a posi- 
tive integer. Then the monoid defined by the generic A-generated k-relation 
presentation (with respect to either the sum length stratification or the max- 
imum length stratification) is rational in the sense of [18j, asynchronous 
automatic and word hyperbolic in the sense of [3j . It also satisfies an ana- 
logue of Kleene 's theorem and has a boolean algebra of rational subsets and 
decidable rational subset membership problem. 
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